Classi er Selection using the Predicate Depth

نویسنده

  • Ran Gilad-Bachrach
چکیده

Typically, one approaches a supervised machine learning problem by writing down an objective function and nding a hypothesis that minimizes it. This is equivalent to nding the Maximum A Posteriori (MAP) hypothesis for a Boltzmann distribution. However, MAP is not a robust statistic. We present an alternative approach by de ning a median of the distribution, which we show is both more robust, and has good generalization guarantees. We present algorithms to approximate this median. One contribution of this work is an e cient method for approximating the Tukey median. The Tukey median, which is often used for data visualization and outlier detection, is a special case of the family of medians we de ne: however, computing it exactly is exponentially slow in the dimension. Our algorithm approximates such medians in polynomial time while making weaker assumptions than those required by previous work.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sequential selection of discrete features for neural networks - A Bayesian approach to building a cascade

A feature selection procedure is used to successively remove features one-by-one from a statistical classi®er by an iterative backward search. Each classi®er uses a smaller subset of features than the classi®er in the previous iteration. The classi®ers are subsequently combined into a cascade. Each classi®er in the cascade should classify cases to which a reliable class label can be assigned. O...

متن کامل

Adaptive Selection of Image Classifiers

Recently, the concept of \Multiple Classi er Systems" was proposed as a new approach to the development of high performance image classi cation systems. Multiple Classi er Systems can be used to improve classi cation accuracy by combining the outputs of classi ers making \uncorrelated" errors. Unfortunately, in real image recognition problems, it may be very di cult to design an ensemble of cla...

متن کامل

Feature selection using Fuzzy Entropy measures with Yu ' s Similarity measure

In this study, feature selection in classi cation based problems is highlighted. The role of feature selection methods is to select important features by discarding redundant and irrelevant features in the data set, we investigated this case by using fuzzy entropy measures. We developed fuzzy entropy based feature selection method using Yu's similarity and test this using similarity classi er. ...

متن کامل

Navigala: an Original Symbol Classifier Based on Navigation through a Galois Lattice

This paper deals with a supervised classi ̄cation method, using Galois Lattices based on a navigation-based strategy. Coming from the ̄eld of data mining techniques, most literature on the subject using Galois lattices relies on selection-based strategies, which consists of selecting/ choosing the concepts which encode the most relevant information from the huge amount of available data. Generall...

متن کامل

Ensembles as a Sequence of Classifiers

Lars Asker Richard Maclin Jet Propulsion Laboratory Department of Computer Science M/S 525-3660 University of Minnesota Pasadena, California 91109-8099 Duluth, Minnesota 55812-2496 Abstract An ensemble is a classi er created by combining the predictions of multiple component classi ers. We present a new method for combining classi ers into an ensemble based on a simple estimation of each classi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013